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Abstract 

Two different sufficieiit conditions are given for the convergence of the 

Magnus expansion arising in the study of the Hnear differential equation 

v—' ' Y' = A(t)Y. The first one provides a bound on the convergence domain 

r^ . based on the norm of the operator A{t). The second condition hnks the 

t^ I convergence of the expansion with the structure of the spectrum of Y{t), 

thus yielding a more precise characterization. Several examples are pro- 
posed to illustrate the main issues involved and the information on the 
convergence domain provided by both conditions. 



00 

(Ti ! 1 Introduction 

The approach followed by Magnus in [21] to solve the non-autonomous linear 

differential equation 
r^ '. dY 

where Y{t) and A[t) are (sufficiently smooth real or complex) n x n matrices, 
^ ' is to express the solution Y[t) as the exponential of a certain matrix fl{t), 

^' Y{t) = expn{t). (1.2) 

By substituting (jl.2p into (jl.ip , one can derive the differential equation satisfied 
by the exponent O |15| : 

n' = dexp^\A{t)), n{0)=O, (1.3) 

where dexp^ is the inverse operator of the power series 

V^ 1 ,fc _ exp(adf7) - 1 

dexpo = > T, TTrado = \ • 

^(^ + 1)! adn 



Specifically, its expression is given by 



dexp-n^)^J]Tfad^(^)- 



fc=0 
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Here {Bk}k£Z+ are the Bernoulli numbers [T], ad is a shorthand for an iterated 
commutator, 

ad^^ = A, ad^+M = [Q, ad'^A] , 

and [J^, A] = QA — AQ. By applying Picard's iteration on (|1.3p . one gets an 
infinite series for r2(t), 

oo 

n{t) = Y,^k{t), (1.4) 



fc=i 



whose first terms read 

ni{t) = I A{ti) dti, 
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^2{t) = - I dh I dt2 [A{ti),Ait2)] (1.5) 



^3(t) = \l dh r dt^r dh {[A{ti), [A{t2), A{t^)]] + [^(ts), [A{t2), A{ti)]]) 
t> Jo Jo Jo 

Explicit formulae for Q.m of all orders have been given in [16], whereas in [19] a 
recursive procedure for the generation of any 0^ was proposed, which presents 
some advantages from a computational point of view. When this recursion is 
worked out explicitly, it is possible to express $!„ as a linear combination of 
m-fold integrals of m — 1 nested commutators containing m operators A, 

^ra{t) = ^ -f Y^ / adf^fei(s) adn,^{s) • • • adn,^,{s)^(s) ds m > 2, 

fel>l,...,fcj>l 

(1.6) 
an expression, however, that becomes increasingly intricate with m, as it should 
be already evident from the first terms (II. Sp . 

Equations (jl.2p and (|1.4p constitute the so-called Magnus expansion for the 
solution of (II. ip . whereas the infinite series (jl.4p with (jl.6p is known as the 
Magnus series. 

Since the 1960s, the Magnus expansion has been successfully applied as a 
perturbative tool in numerous areas of physics and chemistry, from atomic and 
molecular physics to nuclear magnetic resonance and quantum electrodynamics 
(see [2] and [^ for a review and a list of references). Also, since the work by Iser- 
les and N0rsett [16] , it has been used as a tool to construct practical algorithms 
for the numerical integration of equation (II. 1|) , while preserving the main quali- 
tative properties of the exact solution. In this sense, the corresponding schemes 
are prototypical examples of geometric numerical integrators |13j . 

To be more specific, suppose that A{t) belongs to some matrix Lie (sub)- 
algebra q for all t. Then the exact solution of (jl.ip evolves in the matrix Lie 
group Q having g as its corresponding Lie algebra (the tangent space at the 
identity oi Q). Observe now that all terms in the Magnus series are constructed 
as sums of multiple integrals of nested commutators, so that Q and indeed 
any approximation to it obtained by truncation will also be in the same Lie 
algebra. Finally, its exponential will be in Q. By truncating appropriately 



the series, approximating efficiently the multivariate integrals by quadratures 
and reducing the number of required commutators, it is possible to design new 
integrators based on the Magnus expansion which have proved to be highly 
competitive with other, more conventional schemes with respect to accuracy 
and computational effort in the numerical integration of p.l|) on matrix Lie 
groups [^[5|[T5]. 

Although the Magnus expansion has been formulated here only for n x n 
matrices, the same result is also valid (as least formally) in a more general 
setting. As a matter of fact, it was originally established assuming only that 
A{t) is a known function of t in an associative ring [21]. On the other hand, in 
order to apply this approach in Quantum Mechanics, it is tacitly assumed that 
the expansion is also valid when A{t) is a linear operator in a Hilbert space. 

From a mathematical point of view, it is clear that there are at least two 
different issues of paramount importance at the very basis of the Magnus ex- 
pansion: 

• First, for what values of t and for what operators A does equation (jl.ip 
admit a true exponential solution in the form ()1.2p with a certain J7(t)? 
This could be called the existence problem. 

• Second, given a certain operator A{t), for what values of t does the Mag- 
nus series ()1.4p converge? In other words, when Q{t) in (jl.2p can be ob- 
tained as the sum of the series (II. 4p ? This we describe as the convergence 
problem. 

Of course, given the relevance of the expansion, both problems have been 
extensively treated in the literature since Magnus proposed this formalism in 
1954 [21j . In section 2 we review some of the most relevant contributions 
already available regarding both aspects, whereas in the rest of the paper we 
will concentrate ourselves on the convergence issue. Thus, in section 3 we 
provide a general result on the convergence of the Magnus series which is valid 
for bounded linear operators A{t) in a Hilbert space, and not only for real 
matrices. Then, in section 4, we analyze the problem from a different point of 
view, characterizing the convergence (or divergence) of the series in terms of 
the eigenvalues of the matrix Y{t). This allows us, in some cases, to obtain 
more accurate estimates and at the same time gives us more insight into the 
convergence problem. Several examples are also considered to illustrate the 
main issues involved. Finally, section 5 contains a discussion of the results 
obtained. 

2 Existence and convergence of Q{t): previous re- 
sults 

2.1 On the existence of r2(t) 

In most cases one is interested in the case where A(t) belongs to a Lie algebra 
Q under the commutator product. In this rather general setting, Magnus result 



can be formulated as four statements concerning the solution of dY/dt = A{t)Y , 
each one more stringent than the preceding [31]. Specifically, 



(A) The differential equation dY/dt = A{t)Y has a solution of the form Y{t) = 
expr2(t). 

(B) The exponent r2(i) lies in the Lie algebra g. 

(C) The exponent Q.{t) is a continuous differentiable function of A{t) and i, 
satisfying the nonlinear differential equation dQ/dt = dexp^ {A{t)). 

(D) The operator Q(t) can be computed by a series 

where every term is a multivariate integral involving a linear combination 
of nested commutators of A evaluated at different times (i.e., the Magnus 
series (fLl) with (fLGl)). 



We proceed now to analyze in detail the conditions under which statements 
(A)-(C) hold, whereas the validity of (D) will be established by examining the 
convergence problem in the rest of the paper. 

(A) If A{t) and Y{t) are n x n matrices, from well-known general theorems on 
differential equations it is clear that the initial value problem (jl.ip always has 
a uniquely determined solution Y{t) which is continuous and has a continuous 
first derivative in any interval in which A{t) is continuous [7]. Furthermore, the 
determinant of Y is always different from zero, since 

t 



det Y{t) = exp f / tr A{s)ds j 



On the other hand, any matrix Y can be written in the form exp if and only if 
dety 7^ [m page 239], so that it is always possible to write Y{t) = expO(t). 
In the general context of Lie groups and Lie algebras, it is indeed the reg- 
ularity of the exponential map from the Lie algebra g to the Lie group Q that 
determines the global existence of an Q(t) € g [8l[29]: the exponential map of a 
complex Lie algebra is globally one to one if and only if the algebra is nilpotent. 
In general, however, the injectivity of the exponential map is only assured for 
^ € such that ||^|| < pg for a real number pg > and some norm in q |25ll26j . 

(B) Although in principle pg constitutes a sharp upper bound for the mere 
existence of the operator 0, £ q, its practical value in the case of differential 
equations is less clear. For instance, the logarithm ofY(t) may be complex even 
for real A(t) [31]. In such a situation, the solution of (|1.1|) cannot be written as 
the exponential of a matrix belonging to the Lie algebra over the field of real 
numbers. One might argue that this is indeed possible over the field of complex 
numbers, but (i) the element Q cannot be computed by the Magnus series (D), 
since it contains only real rational coefficients, and (ii) examples exist where the 
logarithm of a complex matrix does not lie in the corresponding Lie subalgebra 



It is therefore interesting to determine for which range of t a real matrix 
A{t) in (jl.ip leads to a real logarithm. This issue has been tackled in |26| in 
the context of a complete normed (Banach) algebra, proving that if 

\\A{s)\\2ds<Tr (2.1) 



then the solution of (jl.ip can be written indeed as Y(t) = expi}{t), where i}{t) 
is in the Banach algebra. In ()2.ip . ||.||2 stands specifically for the 2-norm (or 
spectral norm) of A. 

(C) In his original paper [21], Magnus was well aware that if the function i}{t) 
is assumed to be differentiable, it may not exist everywhere. In fact, he related 
the differentiability issue to the existence of the right-hand side of eq. ()1.3p and 
gave an implicit condition for an arbitrary A. More specifically, he proved the 
following result for the case of n x n matrices (Theorem V in [21] ) : 



Theorem 2.1 The equation A{t) = dexp^{Q') can be solved by Q' = dexp^ A{t) 
for an arbitrary A if and only if none of the differences between any two of the 
eigenvalues of VL equals 27rim, where m = ±1, ±2, . . ., (m 7^ 0). 

Unfortunately, such a result has not very much practical application unless we 
can easily determine the eigenvalues of O from those of A{t). 

2.2 Convergence of the Magnus series 

Let us analyze now in some detail statement (D). Magnus considered the ques- 
tion of when the series (jl.4p terminates at some finite index m, thus giving a 
globally valid fi = f^i -|- • • • -|- ^rn- This happens, for instance, if 



A{t), I A{s)ds 
Jo 







identically for all values of t, since then fi^ = for A; > 1. A sufficient (but not 
necessary) condition for the vanishing of all terms Qk with k > I is that 

[^(.i), [A{s2), [A{ss), ■■■ , [A{si),A{si+,)] •••]]]= 

for any choice of si, . . . , s^+i. In fact, the termination of the series cannot be 
established solely by consideration of the commutativity of ^(i) with itself, and 
Magnus considered an example illustrating this point [21] . 

In general, however, the Magnus series does not converge unless A is small 
in a suitable sense. Several bounds to the actual radius of convergence in terms 
of A have been obtained along the years. Most of these results can be stated 
as follows. If ^m.{i) denotes the homogeneous element with m commutators 
in the Magnus series as given by (|1.6p . then Q{t) = Y2m=i ^m{t) is absolutely 
convergent for < t < T, with 

r = maxit>0: / ||A(s)||2 ds < r^ I . (2.2) 



Thus, Pechukas and Light [28] and Karasev and Mosolova [T7] both obtained 
Tc = log 2 = 0.693147 . . . , whereas Chacon and Fonienko [6] got Tc = 0.57745 .... 
In 1998, Blanes et al. [2] and Moan [21] obtained independently the improved 
bound 



1 r^- 1 



-—d3; = ^ = 1.08686869... 
2 + 1(1 -cot I) 

Based on the analysis of some selected examples, Moan [26] concluded that, in 
order to get convergence for all real matrices A{t), necessarily Tc < tt, and more 
recently Moan and Niesen [27] have been able to prove rigorously that indeed 
Tc = vr. 

This result shows, in particular, that statement (D) is locally valid, but 
cannot be used to compute in the large. However, as we have seen, the other 
statements need not depend on the validity of (D). In particular, if (B) and (C) 
are globally valid, one can still investigate many of the properties of Q. even 
though one cannot compute it with the aid of (D). 

3 A generic result on the convergence of the Magnus 
series 

3.1 General formulation 

As we have mentioned before, if A{t) is a real n x n matrix, then (12. ip gives 
a condition for Y(t) to have a real logarithm. Moreover, it has been shown 
that, under the same condition, the Magnus series (II. 4p converges precisely to 
this logarithm, i.e., its sum ^{t) satisfies e^'*-* = Y{t) [27]. Our purpose in this 
section is provide a different proof of this property which in fact is also valid 
in the more general setting of linear operators in a Hilbert space of arbitrary 
dimension. 

To begin with, let A{t) be a bounded linear operator in a Hilbert space Ti, 
with 2 < dim 7i < oo. Let us introduce a new parameter e € C and denote by 
Y{t; e) the solution of the initial value problem 

^ = eA{t)Y, y(0) = /, (3.1) 

where now / denotes the identity operator in Ti. It is known that Y(t; e) is an 
analytic function of e for a fixed value of t. Let us introduce the set B^ C C 
characterized by the real parameter 7, 

B^ = {e£C : \e\ [ \\A{s)\\ds < 7}. 
Jo 

Here ||.|| stands for the norm defined by the inner product on TC. Our first 
statement is that, if t is fixed, the operator function Lp{e) = logy(t;e) is well 
defined in B^ when 7 is small enough, say 7 < log 2, as an analytic function of 
e. 

As a matter of fact, this is a direct consequence of the results collected in sec- 
tion l2.2t if, in particular, |e| L ||A(s)|| ds < log 2, the Magnus series correspond- 
ing to ()3.ip converges and its sum r2(t;e) satisfies e^*'^-* = Y{t;e). In other 



words, the power series il.{t;e) coincides with 99(e) when |e| J^ \\A{s)\\ ds < log 2, 
and so the Magnus series is the power series expansion of 93(e) around e = 0. 
The next theorem shows that, indeed, 7 = vr. 

Theorem 3.1 The function c/7(e) = logy(t;e) is an analytic function of e in 
the set Bt^, with 

B^ = {e£C : \e\ [ \\A{s)\\ds < vr}. 

IfTiis infinite- dimensional, the statement holds true if Y is a normal operator. 

The proof of this theorem is based on some elementary properties of the 
unit sphere S^ in a Hilbert space. Let us define the angle between any two 
vectors x / 0, y / in "H, Angja::, y} = a, < a < vr, from 

Re(x,y) 

cos a = - — — — - , 
\\x\\ \\v\\ 

where (•, •) is the inner product on TC. This angle is a metric in 5^, i.e., the 
triangle inequality holds there. A trivial property which will be used in the 
sequel is the following: if ||x|| = 1 and ||u|| < 1/2, then Ang{x + u,x} < 
||n||(l + ||u||2). 

The second basic property of the angle we need is given by the following 
lemma, whose proof (due to Moan [2B]) is included here for completeness. 

Lemma 3.2 (Moan). For any x ^ inH, Ang{Y{t;e)x,x} < \e\ f^ \\A{s)\\ds 

Proof of Lemma \3.2[ Let y^ = x and consider the vector y(t) = Y{t; e)yo 
satisfying the initial value problem y' = eA{t)y, y(0) = yo- Then, clearly, 
lly'll < klll^(OII lly|l- Let y{t) = ipWiT denote the unit vector in the direction 

of y{t), so that y' = -^ y + ||y||y'. On the other hand, since {y,y) = 1, then 
{y',y) + {y,y') = Re(y, y') = 0, i.e., y and y' are orthogonal. In consequence, 

/ / f\ II /||2 fd\\y\\Y 2||-^/||2 

{y ,y) = \\y II = [—^) + H^H H^ II > 

whence, by discarding the (||y||')^ term, ||y'|p ||y|p < ||y'|P, and thus ||y'|| ||y|| < 
lly'll < |e|||^(OII lly|l o'^ simply ||y'|| < |e|||^(t)||. Integrating this last inequality 
we get 



/ \\y'{s)\\ds<\e\ [ \\A{.s)\\ds, 
Jo Jo 



but 



\y'{s)\\ds= / VWW¥W)ds 
'0 Jo 

is the length (defined through the metric given by the inner product) of the 

curve traced by the unit vector y{s) when s G [0,t] on the unit sphere S^, 

which is greater than or equal to Ang{y(t),yo}) and this proves the result. □ 

Observe that if y is a normal operator in TC, i.e., YY* = Y*Y, where 
Y* denotes the adjoint operator of Y (in particular, if Y is unitary), then 
||y*rE|| = ||yx|| for all X G TC and therefore Ang{y*x,x} = Ang{yx,a;}. 

The following lemma provides useful information on the location of the 
eigenvalues of a given bounded linear operator in TC [23] . 



Lemma 3.3 (Mityagin). LetT he a (hounded) operator onTC. IfAng{Tx,x} < 
7 and Ang{T*j;, 2;} < 7 for any x ^ 0, x £ Ti, where T* denotes the adjoint 
operator of T, then the spectrum ofT, (j{T), is contained in the set 

A^ = {z= Izle''^ EC: |wj < 7} 

Proof of Lenima \3.3[ Without loss of generality, we may assume 7 < vr (if 7 > vr, 
there is no statement here). If dim 7^ < 00, only the first requirement on T, 
AngjTx, x} < 7 is sufficient, since in that case, if A = |A| e*"^ 7^ 0, — vr < lo < tt, 
is in cr{T), then there exists some / 7^ such that Tf = Xf and 

Ang{T/,/} = Ang{A/,/} = |a;|<7. 

If, on the other hand, dim Ti = 00 and A G <y{T), A 7^ 0, then, as is well known, 
either (i) A belongs to the approximate spectrum of T, aap{T), or (ii) A is in 
the residual spectrum, ar{T) [H]. 

(i) In the first case, there is a sequence {/«} in 7i such that ||/„|| = 1 for all n 
and lim„^oo \\(T - XI)fn\\ =0. Equivalently, r/„ = A/„, + e„, with ||e„|| ^0 
when n ^ 00. Then we have 

7 > Ang{r/„, fn} = Ang{A/„ + e„, /„} > Ang{A/„, /„} - Ang{A/„, Xfn + e^} 

(3.2) 
since the angle is a metric in S^. Now, as A = |A| e*"^ 7^ 0, — vr < cj < vr, it is 
clear from (13.21) that 



7 > |a;| - Ang{/„, /„ + jSn] > |w| - ^ (1 + ^^) , (3.3) 

where the last inequality holds when -feJ^ < i i.e., for sufficiently large n. 

Taking the limit n — j- 00 in (j3.3p leads to 7 > |a;|. 

(ii) If A G ar{T), then A is an eigenvalue of T*, i.e., Ker (T* - XI) / {0} [H]. 

Since, by assumption, Ang{T*x, x} < j for all x 7^ 0, we can apply again the 

argument in (i) to T*, A and conclude that |a;| < 7. □ 

Now we are ready to prove the main theorem. 

Proof of Theorem \3.1\ Let us introduce the operator T = Y{t] e), with e € B^, 

7 < vr. Then by Lemma [3. 21 AngjTx, x} < 7 for all x 7^ 0, and thus, by Lemma 



a{T) C A^. (3.4) 

If dim Ti = CO and we assume that Y{t\ e) is a normal operator, then (j3.4|) also 
holds. 

From equation (|3.1|) in integral form. 



y(t;e) =I + £ I A{s)Yds, 
Jo 



one gets ||y|| < 1 + |e| /q ||^(s)|| ||y||ds, and application of Gronwall's lemma 
[T2] leads to 

||y(t;e)||<exp(^|e|^ \\A{s)\\ds' 



An analogous reasoning for the inverse operator also proves that 
\\Y-\t;e)\\<exp(^\e\J^ P(5)Ms) ■ 

In consequence, 

||r||<e'^ and ||r-^|| < e"^. 

If A 7^ G a{T), then jA] < ||r|| [H] and therefore |A| < e"^. In addition, 
J € cr{T~^), so that |A| > e"'^. Equivalently, 

a{T) c{zeC:e-^'< \z\ < e^} = G^. (3.5) 

Putting together ()3.4p and (jS.Sp . one has 

o-(r) c G-y n A^ = A^. 

Now choose any value 7' such that 7 < 7' < vr (e.g., 7' = (7 + 7r)/2) and 
consider the closed curve F = dAy. Notice that the curve T encloses a{T) in 
its interior, so that it is possible to define the function if{e) = logy(t; e) by the 
equation [9] 

(^(e) = -L /■ log ^ (zl - Y{t; e))"' dz, (3.6) 

where the integration along F is performed in the counterclockwise direction. 
As is well known, (j3.6p defines an analytic function of e in B^i [9] and the result 
of the theorem follows. □ 

Theorem 3.4 Let us consider the differential equation Y' = A[t)Y defined in 
a Hilbert space TC with Y{0) = I, and let A{t) be a bounded linear operator 
on Ti. Then, the Magnus series Q{t) = X^^i^A:(0; with Q^ given by lil.b]) 
converges in the interval t G [0, T) such that 

T 

\\A{s)\\ds<T: 


and the sum i}{t) satisfies expQ{t) = Y{t). The statement also holds when 7i is 
infinite- dimensional ifY is a normal operator (in particular, ifY is unitary). 

Proof. Theorem 13.11 shows that \ogY{t:,e) = (p{e) is a well defined and analytic 

function of e for 

ft 



/ \\A{s)\\ds <Tr. 
Jo 



It has also been shown that the Magnus series 0(i;e) = '^'^Li^^^ki't), with 
ilfc given by (jl.6p . is absolutely convergent when \e\ j^ \\A{,s)\\ds < S, = 1.0868... 
and its sum satisfies exp ri(t; e) = Y{f e). Hence, the Magnus series is the power 
series of the analytic function 99(e) in the disk |e| < CZ/n ll^('5)IM'5- But v'(e) 
is analytic in B-j^ D B^ and the power series has to be unique. In consequence, 
the power series of 93(e) in B^^ has to be same as the power series of (^(e) in 
B^, which is precisely the Magnus series. Finally, by taking e = 1 we get the 
desired result. □ 



3.2 Examples 

Theorem 13.41 provides thus sufficient conditions for the convergence of the Mag- 
nus series based on an estimate by the norm of the operator A. In particular, 
it guarantees that the operator Q(i) in Y{t) = expJ7(t) can safely be obtained 
with the convergent series X]fc>i ^fc(0 fo'^ < t < T when the terms f^fc(i) are 
computed with (jl.6p . A natural question arising here is the following: is the 
bound estimate provided by Theorem 13.41 sharp or is there still room for im- 
provement? In order to clarify this issue, we next analyze two simple examples 
involving 2x2 matrices. 

Example 1. Moan and Niesen [27] consider the initial value problem (ll.ip 
with 

A{t) = (^l _iy (3.7) 

If we introduce, as before, the complex parameter e in the problem, the corre- 
sponding exact solution Y(t; e) of (|3.ip is given by 

/ p2et J_ 2et _ M i U\ p-et \ 

ni;£) = (o '^ 1%^'^ ) (3.8) 

and therefore 

, ,,, , / 2t g{t]e)\ ., , , t(l - e^^* + 3ei) 

\ogY{t-e)= „ ^^\' , with g{t-e)- 



-t 7 ' ^' ' ' 3(1 -e3^*) ■ 

The Magnus series can be obtained by computing the Taylor expansion of 
logy(t;e) around e = 0. Notice that the function g has a singularity when 
et = -j^i, and thus, by taking e = 1, the Magnus series only converges up to i = 
Ivr. On the other hand, condition j^ \\A{s)\\ds < vr leads to T « 1.43205 < |7r. 
In consequence, the actual convergence domain of the Magnus series is larger 
than the estimate provided by Theorem 13.41 

Example 2. Let us introduce the matrices 

and define 

' f3X2 < t < 1 



^^*) ' aXi t>l 

with a, (5 complex constants. Then, the solution of equation (jl.ip at t = 2 is 
y(2) =e"^ie^^2^ so that 

0(2) = log(e-^ie'^^2) = aXi + -^^ X^, (3.10) 



1-e- 



an analytic function if \a\ < vr with first singularities at a = ibivr. 

On the other hand, a simple calculation with the recurrence ()1.6p shows 
that 



oo oo on-1 



0(2) = Y^ 0,(2) = aX, + [3X2 + ^(-l)"'^ V «""'/3^2. (3.11) 

fe=l n=2 ^^ ^' 

10 



Comparing with expression (j3.10|) . it is clear that the Magnus series cannot 
converge at t = 2 if |q;| > vr, independently of /3 / 0. 

If we take the spectral norm, then \\Xi\\ = \\X2\\ = 1 and 

\\Ais)\\ds = \a\ + \Pl 



so that the convergence domain provided by Theorem 13.41 is \a\ + |/3| < vr for 
this example. 

From the analysis of Examples 1 and 2 we can conclude the following. First, 
the convergence domain of the Magnus series provided by Theorem 13.41 is the 
best result one can get for a generic bounded operator A(t) in a Hilbert space, 
in the sense that one may consider specific A{t), as in Example 2, where the 
series diverges for any time t such that /„ |j^(s)|[(is > vr. Second, there are also 
situations (as in Example 1) where the bound estimate Tc = vr is still rather 
conservative: the Magnus series converges indeed for a larger time interval than 
that given by Theorem \3.4\ This is particularly evident if one considers equation 
(|3.1|) with A{t) a diagonal matrix, 

■^C) - ( "f J(, ) ■ P^'^) 

Then, the exact solution Y{t; e) of (13. ip is a diagonal matrix whose elements are 
non- vanishing entire functions of e, and obviously logY{t;e) is also an entire 
function of e. In such circumstances, the convergence domain \e\ L \\A{s)\\ds < 
IT for the Magnus series does not make much sense. 

4 Another characterization of the convergence of the 
Magnus series 

4.1 Main result on convergence 

The examples collected in the preceding section (and many more one can build) 
clearly show that, although the condition L \\A{s)\\ds < vr is sharp (in the 
sense that the constant vr is the largest number for which Theorem 13.41 holds in 
general), it is certainly not necessary for the convergence of the Magnus series. 
Thus, it would be highly desirable to have a more realistic criterion which give 
both necessary and sufficient conditions for convergence. 

In [27], a conjecture is formulated, relating the convergence of the Magnus 
series with the eigenvalues of the exact solution Y{t;e). Here we state a theo- 
rem which, on the one hand, explains the phenomena observed by Moan and 
Niesen |27] and, on the other hand, provides a new tool to determine the actual 
convergence domain of the Magnus series in some physically relevant examples 
and applications. 

The main result in this section (Theorem 14. 2p is valid for complex n x n 
matrices and is based on the theory of analytic matrix functions, in particular, in 
the logarithm of an analytic matrix function. In fact, it is a direct consequence 
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of the analysis done in [321 Chapter 1, section 3]. Here we shall summarize the 
most relevant aspects of the formalism and refer the reader to [32] for a more 
detailed treatment (including proofs). 

Our starting point is again the initial value problem Y' = eA{t)Y , Y{Q) = /, 
where now A[t) and Y are (complex) n x n matrices and e G C. If we denote 
by Yt(e) the exact solution for a fixed value of t, Yt{e) = Y{t; e), it is clear that 
Yt{e) is an analytic function of e [7], since the Neumann series 

yi(e)=/ + VeM dti^(ti) / dt2A{t2)--- dtkA{tk) 

j^^l JO Jo Jo 

converges provided that Jq ||^(s)||(is < oo. In addition, detYt{e) ^ for all e. 
Under these conditions, it has been shown that the matrix 0,t{e) = log It (e) 
is also an analytic function of e at e = 0. In other words, the series r2t(e) = 
'^k>i^^t,k (i-6-, the Magnus series) is convergent for sufficiently small e. The 
goal is then to determine the actual radius of convergence r of this series. 

Let us denote hy pi{e), . . . , Pn{£) the eigenvalues of the matrix Yt{e). Notice 
that Yt{0) = I, so that pi(0) = • • • = /3„(0) = 1. It is therefore natural to take 
the principal values of the logarithm, logpi(O) = • • • = log/3„(0) = 0, as this 
choice is consistent with the series f^i(e). 

Let L be a curve on the e plane in the disk |e| < ro < oo issuing from the 
origin. Recall that the matrix Yt{£) is analytic in the disk |e| < rg. On the 
curve L it is possible to define a unique function log pj{e), j = l,...,n, by 
continuity, given the values logpj{0) = 0. 

Let po be a multiple eigenvalue of yj(eo) for some eo with |eo| < '''o with 
multiplicity I. If we reorder the eigenvalues of Yj (eq) in such a way that the first I 
are precisely po, it is clear that the numbers logpi{eo), logp2{so), ■ ■ ■ Aogpii^o), 
1 < I < n, are congruent modulo 27ri and are such that pi{£o) = • • • = pi{£o) = 
Pq. Associated with this multiple eigenvalue po there is a pair of integers {p,q) 
defined as follows. 

The integer p is the greatest number of equal terms in the set of numbers 
log/9i(eo), logp2(eo), ■■■,^ogpi{eo) such that pk{£o) = po, k = 1, . . . ,1. 

The integer q is the maximum degree of the elementary divisors {p — po)^ 
of It(eo)) i-e., the maximum dimension of the elementary Jordan block corre- 
sponding to Po- 

Notice that the numbers / and q depend only on the particular eigenvalue 
Po, whereas the integer p depends on pQ and the curve L. 

Under these conditions, it is possible to prove the following lemma |321 page 
64] on the convergence of the series ^^{(e). 

Lemma 4.1 (Yakubovich-Starzhinskii) . Suppose that the series 

ili(e) = '^k>i^^^t,k satisfies that expili(e) = lt(e) for sufficiently small \e\. 

Then 

(a) If r < ro is the radius of convergence of the series f^i(e), the eigenvalues 
Ai(e), . . . , An(e) of the matrix rit(e), defined for |e| < r, can be defined by 
continuity on the circle \e\ = r, and there exists a point Eq such that for 
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some j,k = 1, . . . ,n 

Xjieo) - Afc(eo) = 27rim, 

where m ^ is an integer. 

(h) Suppose that Eq is the value of e of smallest absolute value (eq ^ 0, \eo\ < 
roj such that the matrix Yt{Eo) has an igenvalue pQ of multiplicity I > 1. 
Suppose that there is at least one such an eigenvalue pQ and at least one 
curve in the disk \e\ < \eo\ joining the origin e = with the point E = Eq 
such that p < q, where the integers p and q have been defined before. Then 
r = \eo\ is the radius of convergence of the series ^t{^) = Ylk>i^''^t,k- 

In order to apply this result one first has to solve the equation 

A(e) = 0, (4.1) 

where A(e) denotes the discriminant of the characteristic polynomial det(l^(e) — 
pi). We recall here that the discriminant of a polynomial 

p{x) = anX^ + On-ix"^'^ H h ttiX + Oq 



is given by 



2n-2TT/„ „ \2 



with ri, . . . , r„ complex roots of p{x), so that it vanishes if and only if p{x) has 
one or more multiple roots [5D]. Thus, it can be used to test for the presence 
of multiple roots, without having to actually compute the roots oi p{x). 

We write the solutions of equation (j4.1|) in order of non-decreasing absolute 
value, 

4^\4^4^--- (4.2) 

and consider the circle \e\ = \eq \ in the complex e-plane. Let pg denote an 
eigenvalue of ^(eg ) with multiplicity li > I. Let e move along some fixed 
curve L from e = to e = Eq in the circle \e\ < \eq \- Then it is clear that 
h eigenvalues pj (e) will tend to Pq at e = Eq . If these points lie at e = Eq 
on the same sheet of the Riemann surface of the function log z, and this is true 
for all (possible) multiple eigenvalues of li(e) at e = eg , then eg is called 
a extraneous root of equation (j4.ip . Otherwise, Eq is called a non- extraneous 
root. 

Now, by Lemma 14.11 when |e| < \eq |, the series for rit(e) is convergent, 
so that the numbers logpj(e) are uniquely determined up to multiplicity as 
eigenvalues of the matrix 0,t{E). 

If eg is an extraneous root, there is no obstacle to the convergence of 



the series and thus we proceed to the next value in the sequence (j4.2p . We 
continue this classification until a non-extraneous root is obtained. Assume, for 

(2) 

simplicity, that Eq is the first non-extraneous root. 

(2) 

The root Eq will generally correspond to some multiple eigenvalue po of 
Yt{£Q '), with integers {p,q) as before. Then the statement of Lemma |4. II can 
be formulated as follows. 
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Theorem 4.2 If r ^ oo is the radius of convergence of the series 

oo 

Vitie) = Y.e'' nt,k, (4.3) 



fc=i 



there is at least one non- extraneous root £q of the equation A(e) = on the 
circle \e\ = r. If for this root one has p < q for some corresponding eigenvalue 
Pq of multiplicity I > 1, then r = |eo|, i.e., the radius of convergence of the 
series r2i(e) is precisely |eo|- 



We should remark here that in some cases with p > q, the series (I4.3P may well 
converge at e = Eq and the radius of convergence r is indeed greater than |eo|- 
This occurs, for instance, when A{t) is diagonal. To illustrate this phenomenon, 
consider again the matrix (|3.12|) with ai(i) = 02 (t). Then, clearly, pi{s) = /52(e) 
for all £, so that / = 2 and q = 1. If we choose logpi(e) = log /72(e)) then 
p = 2>q. 

Although these cases are in a certain sense exceptional, as explained in |321 
page 66], Theorem 14.21 is not yet, strictly speaking, a necessary condition for 
the convergence of the series ()4.3p . In any case, the convergence in the diagonal 
case is compatible with its formulation, as we have seen. 

4.2 Examples 

We next illustrate Theorem 14.21 on three different examples. We first consider 
those analyzed in subsection 13.21 and then we treat in some detail the Magnus 
expansion applied to the evolution operator describing a two-level quantum 
system. 

Example 1 (revisited). Given the exact solution (j3.8p of Example 1 in subsec- 
tion [321 the corresponding discriminant is given by A(e) = (e^'^^-l-e"^*)^ — 4e''*, 
whose roots are 

(1) n A (2) -^^ 

£h = 0, and e\ = ^- 



3t 

The first value, Eq =0, is clearly an extraneous root, so we analyze Eq . As 

(2) 
e varies along the imaginary axis from e = to e = Eq , the eigenvalues of the 

matrix Yt{£), 

Pi{e) = e^'\ P2(e)=e-^* 

move along the unit circle, one clockwise and the other counterclockwise from 

pi,2(0) = 1 to Pl,2(eS)'^) = e^'-/3 ^ ^-i2V3 ^ gi(4V3-2.)_ 
(2) (2) 

Thus, Pi(£q ) and p2{£o ) he on different sheets of the Riemann surface of the 

(2) 
function log 2; and therefore Eq is a non-extraneous root, with p = 1. Since 

Yt{£Q ) 7^ pi, we have q = 2, so that, according to Theorem 14.21 the radius of 
convergence of the series (|4.3p is precisely 

-|e?^l = |. (4.4) 
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To get the actual convergence domain of the corresponding Magnus expansion 
we have to take e = 1, and so, from (j4.4p . we get 27r/(3t) = 1, or equivalently 
t = 27r/3, i.e., the result achieved from the analysis of the exact solution in 
subsection 13.21 



Example 2 (revisited). Let us obtain the convergence domain for the Magnus 
expansion of the solution to the initial value problem Y' = eA{t)Y, Y(0) = I 
when A{t) is the piece-wise continuous matrix defined in Example 2 (subsection 
13. 2p . The exact solution for t > 1 is given by 



I [t; e J — ^ ^ ^—£w 



e" 



where w = a{t — 1). Equation (|4.ip leads in this case to cosh'^{ew) — 1 = 0, 

with first solutions 

e = 0, e = ±i—. 
w 

Again, e = is an extraneous root, whereas the eigenvalues of the matrix Yt[e) 
move along the unit circle, one clockwise and the other counterclockwise from 

yOi,2(0) = 1 to pi,2{iT^/w) = -1 

when e varies along the imaginary axis from e = to e = m/w (the same 
considerations apply to the case e = —m/w). Then, obviously, p = 1 and 
q = 2, so that the radius of convergence of the series ()4.3p is 

, , TT TT 



\w\ |a|(t — 1) 

If we now fix e = 1, we get the actual t-domain of convergence of the Magnus 
series (|1.4p as 

t = l + —. 
\a\ 

Observe that, when t = 2, we get |a| = vr and the result of subsection 13.21 is 
recovered: the Magnus series converges only for |a| < vr. 

Example 3. Our final illustration corresponds to the quantum mechanical 

treatment of a two-level system in a rotating field. It is described by the Hamil- 

tonian 

1 

— / 
2 

where Gx-, cry, az are Pauli matrices, and /? is a coupling constant. In fact, this 
system constitutes a truncation in state space of a more general one, namely an 
atom or freely rotating molecule in a circularly polarized radiation field |3m \TE\ . 
It has been previously established that when t = Itx Jlo the Magnus expan- 
sion of the corresponding evolution operator t/(t), solution of the Schrodinger 
equation 

ih^ = H{t)U, C/(0) = / (4.6) 

at 
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H{t) = -huJoaz + f3{(7x cos cot + ay sintot), (4.5) 



converges for uj > ujq and diverges otherwise [TOl EOl [22]. Several different 
arguments have been offered trying to explain this phenomenon [18]. Here we 
show that this bound can be directly provided by Theorem 14. 2[ 

The exact time-evolution operator can be obtained in closed form by trans- 
forming into a rotating frame. Replacing, as usual, H by eH in (|4.6|) one has 

U{t) = exp ( --wta^ j exp ( -ity-{euJo - w)^^ + e-ro-xj j • (4.7) 

From ()4.7p . a lengthy but straightforward calculation allows us to write the 
corresponding matrix Y{t; e) = U{t) in the form 



e 2 COS ^T- — «^ sm 



) 




^^ y -ie 2^*^ II sin f e^^*^ (cos f + i| sin f ) ' ^'' 

with 5 = eujo — uj and cb = {6'^ + 4/3^e^/?i^)^/^. Denoting 

wi tDt 6 ujt Cot 

cos Y = cos — COS sin — sin — 

2 2 cD 2 2 

= - 1 + ^ COS ^ + 1-^ cos ^ , 

2 u; 2 2 u; 2 

the eigenvalues of Yt{e) can be expressed as 



Pi,2{e) = cos X ± Vcos^x- 1, (4.9) 

so that U{t) has multiple eigenvalues when cos x = il- This equality is satisfied 
by e = 0, which is clearly an extraneous root. The remaining roots of equation 
(|4.1|) are obtained from 

f Ujt Ujt 8 . Ujt . Ujt\ /A-,r,\ 

arccos cos — cos sm — sm — = vr. (4.10) 

V 2 2 w 2 2 y ^ ' 

To simplify the discussion, let us consider the perturbative approximation [3 <C 

h\5\/2. ThentD ~ \5\ and cos x ~ cos(eu;oi/2), so that ()4.10p reduces to ea;ot/2 = 

TT. The solution 

2-K 
£0 = —7 

iOQt 

is a non-extraneous root with p = 1 and q = 2 {pi.2{£o) = — l), and thus the 
radius of convergence of the series (j4.3p is precisely |eo|- Taking now e = 1, we 
get finally the t-domain of convergence of the Magnus expansion tc = 2-11/100. 
Notice that for t = tc and to < loq we are outside the convergence disk, and thus 
the Magnus series diverges, just as noted in |18j . 



5 Discussion 

The Magnus expansion was originally designed by requiring only that A{t) be 
a linear operator depending on a real variable t in an associative ring and that 
"certain unspecified conditions of convergence be satisfied" [21]. The idea was 
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to define, in terms of A, an operator 0,{t) such tliat the solution of the initial 
value problem 

^ = A{t)Y, y(o) = /, 

for a second operator Y is given as y = exp 0. The proposed expression for 
n was an infinite series satisfying the condition that "its partial sums become 
Hermitian after multiplication by i if iA is a Hermitian operator" [21]. The 
simplest example of an equation of this type is given by a finite system of 
linear differential equations. In this case, A{t) is the coefficients matrix of the 
system, and the existence of il. is assured "for sufficiently small values of t" 
|21) . Theorem 12.11 yields an implicit condition in terms of the eigenvalues of the 
matrix Q. 

Given the importance of the expansion, it has been rediscovered a number 
of times in different settings along the years. Also a particular attention has 
been payed to its convergence in the matrix case, and so several bounds on the 
actual radius of convergence of the form ()2.2p have been obtained with different 
values of re- Recently, it has been shown that the optimal value is Tc = vr for 
all n X n real matrices A{t) |27j . 

In this work, by applying standard techniques of complex analysis and some 
elementary properties of the unit sphere, we have generalized this result to 
bounded linear operators in a Hilbert space (Theorem 13. 4p . in the spirit of the 
original Magnus formulation in the context of Quantum Mechanics. Obviously, 
this theorem is also valid for finite dimensional complex matrices. In our treat- 
ment, a complex parameter e is introduced in the formalism, so that the initial 
value problem ()3.ip is considered instead. Notice that the Magnus expansion is 
trivially recovered as soon as we fix e = 1. 

Although Theorem 13.41 provides the optimal convergence domain, in the 
sense that vr is the largest constant for which the result holds without any 
further restrictions on the operator A{t), one can easily construct examples 
showing that L \\A{s)\\ds < vr is not necessary for the convergence of the 
expansion. 

With the aim of obtaining a more precise characterization of the conver- 
gence, we have considered in section 4 the case of n x n complex matrices. 
There, as a straightforward consequence of the theory of analytic matrix func- 
tions, and in particular, of the logarithm of an analytic matrix function such 
as is done in [32], we have established a connection between the convergence of 
the Magnus series and the existence of multiple eigenvalues of the fundamental 
matrix Y{t;e) for a fixed t, denoted by Yt{e) (Theorem 14. 2p . In essence, if the 
analytic matrix function l^(e) has an eigenvalue poi^o) of multiplicity I > 1 for 
a certain eq such that: (a) there is a curve in the e-plane joining e = with 
e = Eq, and (b) the number of equal terms in logpi{eo), logp2{so), ■ ■ ■ ,logpi{eo) 
such that pki^o) = POi k = 1, . . . ,1 is less than the maximum dimension of the 
elementary Jordan block corresponding to po, then the radius of convergence of 
the series ilj(e) = J2k>i^'^^t,k verifying ex.pQt{£) = Yt{e) is precisely r = |eo|- 

This value r in general will be different for each particular t considered, so 
that we can write r = |eo| = F'it) for a given function F{t). In particular, for 

l'n*) = M^andF(i) = S' 



the examples considered in section 4, F{t) = ^, F{t) = \a \ u_i) and F{t) = ^, 
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respectively. If we fix |eo| = 1, then the convergence t-doniain of the Magnus 
expansion is obtained as the solution of F(t) = 1 with the smallest absolute 
value. 

It is interesting at this point to discuss Theorem 12.11 on the existence of a 
differentiable function 0,{t) and Theorem l3.4l in view of the more precise account 
on the convergence issue provided by Theorem 14. 2 [ First, note that under the 
assumptions of Theorem 13.41 all the eigenvalues of Y(t) lie in the region 

G^ = {z = \z\e''^ € C : e-'^ < \z\ < e^ , \oj\ < vr}, 

so that automatically all the differences between any two of the eigenvalues of 
Q{t) = log Y{t) is less than l-rri and thus Theorem 12.11 holds. Second, if all 
eigenvalues of Yt{£) are located in Gjr, the (possible) multiple eigenvalues take 
place only at extraneous roots of the parameter e and thus, according with 
Theorem 14.21 the convergence of the series is assured. 

One might think that the practical application of Theorem 14.21 to realistic 
problems is doubtful, since it is necessary to compute in advance the funda- 
mental matrix y(i;e). In this sense, the alternative (but more conservative) 
estimate provided by Theorem 13.41 directlv in terms of the operator A(t) is cer- 
tainly easier to check in practice. In our opinion, however, the characterization 
of the convergence of the Magnus expansion in terms of the multiple eigenvalues 
of y(t; e) sheds new light on this issue, has a theoretical interest by itself and, 
in addition, provides a rigurous justification for the conjecture formulated in 
|27| on the basis of the exploration of several examples. 
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